NLTK: The Natural Language Toolkit
نویسندگان
چکیده
The Natural Language Toolkit is a suite of program modules, data sets, tutorials and exercises, covering symbolic and statistical natural language processing. NLTK is written in Python and distributed under the GPL open source license. Over the past three years, NLTK has become popular in teaching and research. We describe the toolkit and report on its current state of development.
منابع مشابه
ar X iv : c s . C L / 0 20 50 28 v 1 1 7 M ay 2 00 2 NLTK : The Natural Language Toolkit
NLTK, the Natural Language Toolkit, is a suite of open source program modules, tutorials and problem sets, providing ready-to-use computational linguistics courseware. NLTK covers symbolic and statistical natural language processing, and is interfaced to annotated corpora. Students augment and replace existing components, learn structured programming by example, and manipulate sophisticated mod...
متن کاملComputational Semantics in the Natural Language Toolkit
NLTK, the Natural Language Toolkit, is an open source project whose goals include providing students with software and language resources that will help them to learn basic NLP. Until now, the program modules in NLTK have covered such topics as tagging, chunking, and parsing, but have not incorporated any aspect of semantic interpretation. This paper describes recent work on building a new sema...
متن کاملMultidisciplinary Instruction with the Natural Language Toolkit
The Natural Language Toolkit (NLTK) is widely used for teaching natural language processing to students majoring in linguistics or computer science. This paper describes the design of NLTK, and reports on how it has been used effectively in classes that involve different mixes of linguistics and computer science students. We focus on three key issues: getting started with a course, delivering i...
متن کاملRecursos en euskera para la herramienta NLTK para enseñanza de procesamiento del lenguaje natural
We present the resources we have adapted in order to enable NLTK package to deal with text in Basque.
متن کاملCriando um corpus sobre desastres climáticos com apoio da ferramenta NLTK (Creating a Corpus about Climate Disasters with the Support of the NLTK Tool) [in Portuguese]
This work is part of a broader research that explores information from a corpus of news about climate disasters and automatically recognizes, with the support of a tool for Natural Language Processing (NLP), words that denote the main actors involved and their actions in providing relief to victims. It starts with the hypothesis of Steinberger [2005] that news reports of disasters not only allo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره cs.CL/0205028 شماره
صفحات -
تاریخ انتشار 2002